CDS

Accession Number TCMCG021C03586
gbkey CDS
Protein Id XP_010909364.1
Location 79592..80983
Gene LOC105035494
GeneID 105035494
Organism Elaeis guineensis

Protein

Length 463aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010911062.3
Definition putative L-cysteine desulfhydrase 1 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category E
Description Isopenicillin N
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00782        [VIEW IN KEGG]
KEGG_rclass RC00382        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K22207        [VIEW IN KEGG]
EC 4.4.1.28        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00270        [VIEW IN KEGG]
map00270        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATCCCCACCATGATCATCACGCCGAGAACGGCCGTCATGACGGCGACGGCTGCGGCGACGACACCAACGGCCCCCTCCCGAAGCGGCCGCGGCCGTCGCCATCCCACATCTCCGCCGCCGAGATCCGCGACGAGTTTTCCCATCACGACCCCGCCGTCGCCCGCATCAATAACGGCAGCTTCGGCAGCTGCCCGGCCTCCGTCCTCGACGCCCAGCTCCGATGGCAGCGTCTTTTCCTCTGCCAGCCCGACGACTTCTACTTCAACCGCCTCCAGCCCTCTCTCCTCCGCTCCCGCGCCATCATCAAAGACCTCATCAACGCCGACGACGTCGAGGAGGTCTCCCTCGTGGACAACGCCACCACTGCCGCCTCCATCGTCCTCCAGCACGTTTCGTGGGCCTTCACTGAGGGCCATTTCAAGAAGGGCGACGCCGTCGTCATGCTCCACTACGCCTACGGCGCCGTCAAGAAGTCCATCCAGGCCTACGTTACCCGTGCCGGCGGCCATGTTATCGAGGCTCCCCTCCCCTTCCCTGTGACCTCCAACGAAGAGATCGTTCAAGAATTCCGGAAGGCGTTGGAGCTTGGGAAGTTCAACGGTCGGAACGTCCGGCTGGCTGTAATCGACCACATTACCGCGATGCCGAGCGTCCTCATCCCTGTTAAAGAATTGATCAAGATTTGCCGGGAGGAAGGTGTAGACAAGGTGTTTGTCGATGCTGCGCATGCAATTGGGAGCGTCGAGGTCGACATGAAGGACATTGGGGCTGATTTCTACACCAGCAACCTCCACAAGTGGTTCTTTTGCCCGCCATCGGTTGCATTCTTGTACTCCAAGAAGTGCTTGGCCTCGTCTGACTTGCACCACCCGGTGGTCTCACATGAGTATGGGAATGGACTCCCAATGGAGAGCGGGTGGATTGGCACCCGGGATTACAGCGCCCAGCTCGTAGTGCCATCAGTGATGGATTTCATTAGTAGGTTTGAAGGAGGAATTGAAGGAATTTGGAAGAGGAATCATGATAAGGTAGTGGAGATGGGGAAGTTGCTGGCCAAGTCATGGGGCACTTGTCTTGGGTCACCCCCGGATATGTGCCCAAGTATGATCATGGTTGGTCTACCAGGATGCTTGGGAATTTCAAGTGAGAAGGATGCTCAGAAGTTTAGGAGCCTCTTGAGGGATCAATTCCATGTTGAGGTTCCTGTATATCATCAGTCTCCAAAGGATGGTGAGAACGACAATCCGGATCAGAGCAGTTCTGTGACTGGGTATGTGAGAATTTCTCATCAGGTCTATAATGTGGAAGATGATTACATTAGACTCAGGGATGCGATCAACAAACTTGTTCATGATGGATTCAACTGCACCATGCTGTCATCCAGTTAG
Protein:  
MDPHHDHHAENGRHDGDGCGDDTNGPLPKRPRPSPSHISAAEIRDEFSHHDPAVARINNGSFGSCPASVLDAQLRWQRLFLCQPDDFYFNRLQPSLLRSRAIIKDLINADDVEEVSLVDNATTAASIVLQHVSWAFTEGHFKKGDAVVMLHYAYGAVKKSIQAYVTRAGGHVIEAPLPFPVTSNEEIVQEFRKALELGKFNGRNVRLAVIDHITAMPSVLIPVKELIKICREEGVDKVFVDAAHAIGSVEVDMKDIGADFYTSNLHKWFFCPPSVAFLYSKKCLASSDLHHPVVSHEYGNGLPMESGWIGTRDYSAQLVVPSVMDFISRFEGGIEGIWKRNHDKVVEMGKLLAKSWGTCLGSPPDMCPSMIMVGLPGCLGISSEKDAQKFRSLLRDQFHVEVPVYHQSPKDGENDNPDQSSSVTGYVRISHQVYNVEDDYIRLRDAINKLVHDGFNCTMLSSS